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In this paper we investigate error-thresholds on dynamics fitness-landscapes. We show that there 
exists both lower and an upper threshold, representing limits to the copying fidelity of simple 
replicators. The lower bound can be expressed as a correction term to the error-threshold present 
on a static landscape. The upper error-threshold is a new limit that only exists on dynamic fitness- 
landscapes. We also show that for long genomes on highly dynamic fitness-landscapes there exists a 
lower bound on the selection pressure needed to enable effective selection of genomes with superior 
fitness independent of mutation rates, i.e., there are distinct limits to the evolutionary parameters 
in dynamic environments. 



Ever since Eigen's work on replicating molecules in 
1971 EL the concept of quasi-species has proven to be a 
very fruitful way of modeling the fundamental behavior 
of evolution. A quasi-species is an equilibrium distribu- 
tion of closely related gene sequences, localized around 
one or a few sequences with high htness. The combi- 
nation of simplicity and mathematical preciseness makes 
it possible to isolate the effects of different fundamental 
parameters in the model. It also makes it possible to 
capture some general phenomena in nature, such as the 
critical relation between mutation rate and information 
transmission . The kinetics of these simple systems 
has been studied in great detail as the formulation has 
allowed many of the techniques of statistical physics to 
be applied to replicator and evolutionary systems. See 
for instance [H-flfl. 



The appearance in these models of an error-threshold 
(or error-catastrophy) as an upper bound on the muta- 
tion rate, above which no effective selection can occur, 
has important implications for biological systems. In par- 
ticular it places limits on the maintainable amounts of ge- 
netic information p]fl[16[ which puts strong restrictions 
on possible theories for the origins of life. It is interest- 
ing to note that some RNA-viruses seem to have evolved 
mutation rates that are close to the error-threshold Q 

Studies of quasi-species until now have focused on 
static fitness-landscapes. Many organisms in nature 
however live in a quickly changing environment [ JL7| . 
This is especially important for viruses and other mi- 
crobial pathogens that must survive in a host with an 
highly dynamic immune system for which there only ex- 
ist tight and temporary niches with high fitness (for the 
pathogen). 

In this paper we investigate how the critical mutation 
rate of the error threshold is affected by a dynamical 
fitness-landscape. We show how the critical mutation 



rate is lowered by shifts of the fitness-peak. An sim- 
ple analytical expression for this critical copying fidelity 
is also presented. It also turns out that if the selection 
pressure is too small, the fitness-landscape moves too fast 
and the fitness encoding genome is too large, the popu- 
lation will lose the fitness-peak independent of mutation 
rate. This shows the existence of regions in parameter 
space where no selection can occur despite possibilities 
of adjusting copying-fidelity. 

In brief a quasi-species consists of a population of self- 
replicating genomes represented by a sequence of bases 
Sk, {s\S2 ■ ■ ■ s n ). Hereafter we will assume binary bases 
{1,0} and that all sequences have equal length n though 
these restrictions are easily relaxed. Every genome is 
then given by a binary string (011001 • ■ •), which can be 
represented by an integer k (0 < k < 2"). 

To describe how mutations affect a population we de- 
fine Wj. as the probability that replication of genome I 
gives genome k as offspring. For perfect copying accu- 
racy, Wj. equals the identity matrix. Mutations however 
give rise to off diagonal elements in Wj. . Since the genome 
length is fixed to n we will only consider point mutations, 
which conserve the genome length. 

We assume that the point mutation rate p = 1 — q 
(where q is the copying accuracy per base) is constant in 
time and independent of position in the genome. We can 
then write an explicit expression for W\ in terms of the 
copying fidelity: 



W l =p h Mq n-h M 
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where h^i is the Hamming distance between genomes k 
and I, and n is the genome length. The Hamming dis- 
tance hki is defined as the number of positions where 
genomes k and I differ. 

The equations describing the dynamics of the popula- 
tion now take a relatively simple form. Let Xk denote 
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the relative concentration and Ak the fitness of genome 
k. We then obtain the rate equations: 
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1X1 - exk 
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where e = J2i M x l an d the dot denotes a time derivative. 
The second term ensures the total normalization of the 
population (as xi — 1) so that Xk describes relative 
concentrations . 

To create a dynamic landscape we consider a single 
peaked fitness landscape Q whose peak moves, result- 
ing in different optimal gene sequences at different times. 
Formally we can write Ak(t) — & and Ai = 1 V7 ^ k(t) 
where the (changing) genome k(t) describes how the peak 
moves through sequence space. If k(t) is constant in 
time the rate equation [Eq. ^] corresponds to the classi- 
cal (static) theory of quasi-species studied by Eigen and 
others. 

We allow the peak in the fitness landscape to move 
to one of its closest neighbors (chosen ranomly) . In this 
paper we assume that movements occur with a fixed fre- 
quency but one could also consider a probabilistic move- 
ment. 

The mutation matrix W describes point mutations 
which occurr with equal probability independent of po- 
sition in the genome. This imposes a symmetry on the 
rate equations, dividing the relative concentrations into 
error classes Ti described by their Hamming distance i 
from the master sequence (r ). This reduces the effec- 
tive dimension of the sequence space from 2™ to n + 1 
thereby making the problem analytically tractable. The 
use of assymctric evolution operators (such as recombina- 
tion) or fitness landscapes is obviously significantly more 
problematic and is the subject of ongoing work. When 
the fitness peak moves this landscape symmetry will be 
broken since one sequence in Ti will be singled out as 
the new master sequence. This would only affect the re- 
sults we present below if the mean time between shifts 
in the fitness-landscape was small — as there would then 
be a substantial concentration of the old master sequence 
present when the peak moves back into this error-class. 
We assume the dynamics to be slow enough for this not 
to be a problem. 

Moving the fitness peak then corresponds to applying 
the following co-ordinate transformation to the concen- 
tration vector: 
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To study the population dynamics we may divide the dy- 
namics into cycles from time to r, where r is a parame- 
ter determining the number of generations between shifts 
of the fitness peak when the evolution proceeds as for a 
static landscape. We then apply the R transformation 



to the concentration vector. The resulting concentration 
distribution is used as the initial condition for the rate 
equations from time r to 2r and so on. These popula- 
tion dynamics JEq. ||and R] may be solved numerically as 
shown in Fig. [j] (after the initial transient) where r = 5, 
a = 10, q = 0.999 and string-length n = 50. 
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FIG. 1. This plot shows the population dynamics for gene 
sequences of length 50 when the fitness peak moves every fifth 
generation (r = 5) and the height of the fitness peak a is 10. 
The result is shown from t = 25 to remove initial transients. 

A simple approximation of the model presented above 
enables us to derive analytical expressions for the error- 
thresholds on a dynamic fitness-landscape. Neglecting 
back mutations into the master-sequence, we can write 
the rate equation for the master-sequence on a static 
fitness-landscape as 



Q<7X r , 



(4) 



where Q = q n is the copying fidelity of the whole genome 
and e = ux mas + 1 — x mas . The asymptotic concentration 
of master-sequences is 
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This implies that the error-threshold on a static fitness- 
landscape occurs when 
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(see e.g., KL§|)- This result is also intuitively clear 
since the superior fitness (and hence growth rate) of the 
master-sequence must compensate for the loss of Tq indi- 
viduals due to mutations that occur during replication. 

The intuitive picture of the error-threshold on a dy- 
namic fitness-landscape is different: what determines the 
critical mutation rate is whether the master-sequence will 
have time to regrow between the shifts of the fitness- 
peak. To find an analytical approximation for the error- 
treshold we have to expand Eq. ([I]) to include the dynam- 
ics of error-class one as well as the master-sequence. This 
is necessary since the fitness-peak moves into error-class 
one every r time-steps. We can, however, make a large 
simplification by assuming the growth of the master- 
sequence to be in the exponential regime, i.e., that we can 
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neglect the non-linearity in Eq. (Q). This is a good ap- 
proximation near the error-threshold as, for these values 
of q, the master-sequence will not have time to approach 
any kind of equilibrium before the peak shifts again. We 
can thus write an approximation of the rate equations 
for the master-sequence and a representative member of 
error-class one: 



Xmas — 1) ^mos 

iij = Q<jx mas + (Q- l)xtj 
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The shifting involves the move of the fitness peak to 
one of the sequences in error-class one at time t = r. 
The initial concentration of master-sequences at the be- 
ginning of a shift cycle is therefore x mas (0) = x\ (r). 
If the concentration of the master-sequence after the 
shift is lower than immediately after the previous shift, 
i.e. x m as (0) > x% (r), the distribution of concentra- 
tions will converge towards a uniform distribution. This 
is, in effect, a definition of the error-threshold. A con- 
dition for effective selection is then given by inserting 
x m as (0) < x\ (r) into Eq. (||). We then derive a master- 
sequence growth parameter 
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It is not possible to find exact analytical solutions 
for the roots of Eq. (||) and hence the error-thresholds. 
Fig. ^ shows the region where Eq. (||) can be expected to 
hold. The figure also shows the existence of two error- 
thresholds, qf^ n and q^ n corresponding to the real roots 
of k = 1. The lower threshold is a new version of 
the static error-threshold, with a pertubation resulting 
from the movement of the fitness-landscape. The upper 
threshold is a new phenomenon that appears only on dy- 
namic fitness-landscapes. Its existence is intuitively clear 
— if the mutation rate is very close to zero, there will not 
be enough individuals present on the new peak position 
when the shift occurrs to maintain a steady occupancy of 
the master sequence, i.e. the peak moves out from under 
the quasi-species and the population will not be able to 
track shifts in the fitness-landscape. 



dyn 



dyn 



0.994 0.996 

q 



where mutations into the member of error-class one are 
neglected and Q = (1 — q) q 11 ^ 1 describes mutation from 

Xmas 

into xij. We now assume Xij (0) = 0, which is a 
good approximation since x±j is (almost always) in T2 be- 
fore the shift. The solutions to Eq. (0) using this bound- 
ary condition can be written as 

T (t) = X (0) p(9" cr - 1 ) t 

•^mas \ L ) — ^mas V u 7 c 

Xu(t) =Xmas(0) ^ , 7T M ^ (8) 



FIG. 2. The left hand side of Eq. |^ is plotted as a function 
of the copying fidelity q. The genome length n = 50, r = 2 
and cr = 5. The lower threshold is located at a,Y n = 0.988 



and the upper threshold at g^ n = 0.9997 



Analytical approximations to the error-thresholds can 
be found by assuming different dominant terms in the 
two different regions. To find the lower threshold q, yn 
we asssume </' 
gives 



to dominate the behavior. Solving for q r ' 
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We can use Eq. (|10|) to find a first order correction in r 
to the static threshold by putting q — -^tp: on the right 
hand side 
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(11) 



where we also made the approximation w 1. This is 
an expression for the lower error-threshold on a dynamic 
fitness-landscape. Note that Q^ n — * Qcrit when r 
i.e. we recover the stationary landscape limit. 
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FIG. 3. The mean fitness is plotted as a function of the 
copying fidelity per base q. The fitness peak moves every 
other generation (r = 2), the string-length n is 50 and the 
growth superority of the master sequence a is 10. The er- 
ror-threshold occurs at the predicted value qf% n = 0.973. The 
static error-threshold is located at q atat — 0.955. 

Fig. U shows the mean fitness of a population as a func- 
tion of the copying-fidelity. When q is below qf^ n , the 
concentration of master-sequences is approximately zero 
and the mean fitness will therefore be 1. The figure is 
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based on numerical simulations of the full rate equations 
[Eq. Note that the predicted value of qf^ n given by 
Eq. is quite accurate. Further comparisons to nu- 

merical solutions to the full dynamics are shown in ta- 
ble | 

Both the qualitative and quantitative dynamics of both 
error thresholds have been verified by computer simula- 
tions using large populations to approximate the deter- 
ministic dynamics. 

The critical copying fidelity QiY 1 depends on the 
genome-length. This is not surprising since the fitness- 
peak shifts into a specific member of Ti, which consists 
of n different gene-sequences. It is, however, a direct 
consequence of the dynamic fitness-landscape since the 
static error-threshold is independent of genome-length. 
This effect is demonstrated in Fig. |], where Qi„ n ver- 
sus the genome-length is plotted. The perturbation from 
the static error-threshold increases with genome-length. 
The derivative is however decreasing and for reasonable 
values of t 3> 1 and a ^> 1 the static and dynamic error- 
threshold are of the same order of magnitude and show 
the same scaling behaviour. 
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FIG. 4. The plot shows how the error-threshold Q t ^ n given 
by Eq. ( |ll| ) depends on the genome- length. The parame- 
ters describing the fitness-landscape are fixed, a = 10 and 
r = 5. The static error-threshold is located co-incident with 
the x-axis at Q stat = 0.1. 
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TABLE I. The table shows results of numerical solutions 
of the error-threshold compared to predicted values given by 
Eq. (|ll]) and the threshold for the corresponding static fit- 
ness-landscape. 



An analytical approximation to the new upper thresh- 
old can be found by assuming q to be very close to 1 
and therefore the (1 — g)-term dominates the behaviour 
of Eq. (||). Again assuming a 3> 1 and putting q n = 1, 
gives 
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Explicit numerical solutions of the full dynamics con- 
firm that this threshold exsists and is predicted by 
Eq. (p^). For most values of a and r, q^ n is very close to 
1 (e.g. (<r — 1) r = 50 gives 10 -22 as a lower bound on the 
mutation rate per base pair). Finite population affects 
are however significant for the upper error-threshold. In 
real biological populations this may be imposrtant. More 
detailed studies of these issues arc under preparation. 

It is important to note that qf^ 71 is independent of 
the genome-length. The total copying fidelity Qf^ n = 
(g^i" 1 ) will then depend strongly on the genome-length. 

This means that as the genome-length increases, the 
evolvable gap in between the two error-thresholds nar- 
rows. 

On a static fitness-landscape it is always possible to 
find copying fidelities high enough for evolution to be 
effective. It turns out that this is no longer the case 
for dynamic fitness-landscapes. There exist regions in 
parameter-space (spanned by a, r and n) where solutions 
to Eq. (||) cease to exist. This happens when the upper 
and lower error-thresholds coincide or, to put it differ- 
ently, when the maximum (taken over q) of the left hand 
side of Eq. (|^) become less than 1. To find this conver- 
gence point it is better to search for a direct approxima- 
tion of q that maximizes the left hand side of Eq. (^|) as 
the approximations for upper and lower error-thresholds 
given above become less accurate when they are close to- 
gether, To do this we assume the leading behaviour is 
determined by the factor e^ q <T ~ 1 ) T (1 — q). Taking the 
derivative of this expression and setting it to zero gives 
the equation g" -1 (1 — q) = — =— . Assuming q to be very 
close to 1, and hence q n ~ 1 w 1 gives 
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TABLE II. The minimum selection pressure required for 
an infinite population to track the peak is listed for different 
values of the genome length n and the number of generations 
between shifts of the fitness-peak r. 
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qmax » 1 (13) 

am 

This approximation for (; maa ; can be substituted into 
Eq. (||). It is easy find points in phase space where this 
inequality starts to hold by fixing two parameters (e.g., 
r and n) and then numerically solving for the third (a). 
Table O shows the minimal height of the fitness-peak for 
different values of r and n. The required selective pres- 
sure becomes large for fast moving fitness-landscapes and 
large genome lengths. 

In conclusion we have shown existence of, and derived 
analytic expressions for, two error-thresholds on a simple 
dynamic fitness-landscape. The lower threshold is a per- 
turbation of the well known error-catastrophy that exists 
a static fitness-landscape that accounts for the destabi- 
lizing effect of the changing environment. The existence 
of an upper bound on the copying fidelity is a new phe- 
nomenon, only existing in dynamic environments. The 
presence of this upper bound results in the existence of 
critical regions of the landscape parameters (a, r and n) 
where the two thresholds coincide (or cross) and threre- 
fore no effective selection can occur. Thus dynamics land- 
scapes have strong constraints on evolvability. 
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